Multi-lingual Spoken Dialog Translation System Using Transfer-driven Machine Translation

نویسندگان

  • Hideki Mima
  • Osamu Furuse
  • Yumi Wakita
چکیده

This paper describes a Transfer-Driven Machine Translation (TDMT) system as a prototype for efficient multi-lingual spoken-dialog translation. Currently, the TDMT system deals with dialogues in the travel domain, such as travel scheduling, hotel reservation, and trouble-shooting, and covers almost all expressions presented in commercially-available travel conversation guides. In addition, to put a speech dialog translation system into practical use, it is necessary to develop a mechanism that can handle the speech recognition errors. In TDMT, robust translation can be achieved by using an example-based correct parts extraction (CPE) technique to translate the plausible parts from speech recognition results even if the results have several recognition errors. We have applied TDMT to three language pairs, i.e., Japanese-English, Japanese-Korean, Japanese-German. Simulations of dialog communication between different language speakers can be provided via a TCP/IP network. In our performance evaluation for the translation of TDMT utilizing 69-87 unseen dialogs, we achieved about 70% acceptability in the JE, KJ translations, almost 60% acceptability in the EJ and JG translations, and about 90% acceptability in the JK translations. In the case of handling erroneous sentences caused by speech recognition errors, although almost all translation results end up as unacceptable translation in conventional methods, 69% of the speech translation results are improved by the CPE technique.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Example-Based Method for Transfer-Driven Machine Translation

This paper presents a method called Transfer-Driven Machine Translation (TDMT), which utilizes an example-based framework for various process and combines multi-level knowledge. An example-based framework can achieve quick processing and consistently describe knowledge. It is useful for spoken-language translation, which needs robust and efficient translation. TDMT strengthens the example-based...

متن کامل

Improving Performance of Transfer-Driven Machine Translation with Extra-Linguistic Informatioon from Context, Situation and Environment

This paper describes an improvement in the performance of Transfer-Driven Machine Translation ( T D M T ) by the use of extralinguistic information. In evaluating what constitutes natural speech, particular attention was paid to word usage that depended on the extra-linguistic information from the context, situation, environment and so on. We discuss what types of extra-linguistic information a...

متن کامل

Cross-lingual spoken language understanding from unaligned data using discriminative classification models and machine translation

This paper investigates several approaches to bootstrapping a new spoken language understanding (SLU) component in a target language given a large dataset of semantically-annotated utterances in some other source language. The aim is to reduce the cost associated with porting a spoken dialogue system from one language to another by minimising the amount of data required in the target language. ...

متن کامل

Speech{language Integration in a Multi{lingual Speech Translation System

In this paper we report on our e orts to combine speech and language processing toward multi-lingual spontaneous speech translation. The ongoing work extends our JANUS system e ort toward handling spontaneous spoken discourse and multiple languages. A major objective of this project is to maximize the number of modules, methods and data structures that are language-independent and extensible to...

متن کامل

Using Prosodic Cues in Spoken Dialog Systems

In this paper we show how prosody can be used in spoken dialog systems. First, we describe the phenomena that prosodic analysis is concerned with and give examples why prosody is relevant in the context of spoken dialog processing. Then we examine prosody in the light of pattern classification. We show how prosodic events can be categorized. We detail how those prosodic events manifest themselv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006